Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 233
Filtrar
1.
Proteomics ; 24(8): e2300144, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38629965

RESUMO

In protein-RNA cross-linking mass spectrometry, UV or chemical cross-linking introduces stable bonds between amino acids and nucleic acids in protein-RNA complexes that are then analyzed and detected in mass spectra. This analytical tool delivers valuable information about RNA-protein interactions and RNA docking sites in proteins, both in vitro and in vivo. The identification of cross-linked peptides with oligonucleotides of different length leads to a combinatorial increase in search space. We demonstrate that the peptide retention time prediction tasks can be transferred to the task of cross-linked peptide retention time prediction using a simple amino acid composition encoding, yielding improved identification rates when the prediction error is included in rescoring. For the more challenging task of including fragment intensity prediction of cross-linked peptides in the rescoring, we obtain, on average, a similar improvement. Further improvement in the encoding and fine-tuning of retention time and intensity prediction models might lead to further gains, and merit further research.


Assuntos
Ácidos Nucleicos , RNA , Aminoácidos , Espectrometria de Massas , Peptídeos
2.
Bioinformatics ; 40(4)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38498849

RESUMO

MOTIVATION: Cross-linking mass spectrometry has made remarkable advancements in the high-throughput characterization of protein structures and interactions. The resulting pairs of cross-linked peptides typically require geometric assessment and validation, given the availability of their corresponding structures. RESULTS: CLAUDIO (Cross-linking Analysis Using Distances and Overlaps) is an open-source software tool designed for the automated analysis and validation of different varieties of large-scale cross-linking experiments. Many of the otherwise manual processes for structural validation (i.e. structure retrieval and mapping) are performed fully automatically to simplify and accelerate the data interpretation process. In addition, CLAUDIO has the ability to remap intra-protein links as inter-protein links and discover evidence for homo-multimers. AVAILABILITY AND IMPLEMENTATION: CLAUDIO is available as open-source software under the MIT license at https://github.com/KohlbacherLab/CLAUDIO.


Assuntos
Peptídeos , Software , Peptídeos/química , Espectrometria de Massas , Reagentes de Ligações Cruzadas/química
3.
Neurol Res Pract ; 6(1): 15, 2024 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-38449051

RESUMO

INTRODUCTION: In Multiple Sclerosis (MS), patients´ characteristics and (bio)markers that reliably predict the individual disease prognosis at disease onset are lacking. Cohort studies allow a close follow-up of MS histories and a thorough phenotyping of patients. Therefore, a multicenter cohort study was initiated to implement a wide spectrum of data and (bio)markers in newly diagnosed patients. METHODS: ProVal-MS (Prospective study to validate a multidimensional decision score that predicts treatment outcome at 24 months in untreated patients with clinically isolated syndrome or early Relapsing-Remitting-MS) is a prospective cohort study in patients with clinically isolated syndrome (CIS) or Relapsing-Remitting (RR)-MS (McDonald 2017 criteria), diagnosed within the last two years, conducted at five academic centers in Southern Germany. The collection of clinical, laboratory, imaging, and paraclinical data as well as biosamples is harmonized across centers. The primary goal is to validate (discrimination and calibration) the previously published DIFUTURE MS-Treatment Decision score (MS-TDS). The score supports clinical decision-making regarding the options of early (within 6 months after study baseline) platform medication (Interferon beta, glatiramer acetate, dimethyl/diroximel fumarate, teriflunomide), or no immediate treatment (> 6 months after baseline) of patients with early RR-MS and CIS by predicting the probability of new or enlarging lesions in cerebral magnetic resonance images (MRIs) between 6 and 24 months. Further objectives are refining the MS-TDS score and providing data to identify new markers reflecting disease course and severity. The project also provides a technical evaluation of the ProVal-MS cohort within the IT-infrastructure of the DIFUTURE consortium (Data Integration for Future Medicine) and assesses the efficacy of the data sharing techniques developed. PERSPECTIVE: Clinical cohorts provide the infrastructure to discover and to validate relevant disease-specific findings. A successful validation of the MS-TDS will add a new clinical decision tool to the armamentarium of practicing MS neurologists from which newly diagnosed MS patients may take advantage. Trial registration ProVal-MS has been registered in the German Clinical Trials Register, `Deutsches Register Klinischer Studien` (DRKS)-ID: DRKS00014034, date of registration: 21 December 2018; https://drks.de/search/en/trial/DRKS00014034.

5.
Proteomics ; 24(3-4): e2300068, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37997224

RESUMO

Top-down proteomics (TDP) directly analyzes intact proteins and thus provides more comprehensive qualitative and quantitative proteoform-level information than conventional bottom-up proteomics (BUP) that relies on digested peptides and protein inference. While significant advancements have been made in TDP in sample preparation, separation, instrumentation, and data analysis, reliable and reproducible data analysis still remains one of the major bottlenecks in TDP. A key step for robust data analysis is the establishment of an objective estimation of proteoform-level false discovery rate (FDR) in proteoform identification. The most widely used FDR estimation scheme is based on the target-decoy approach (TDA), which has primarily been established for BUP. We present evidence that the TDA-based FDR estimation may not work at the proteoform-level due to an overlooked factor, namely the erroneous deconvolution of precursor masses, which leads to incorrect FDR estimation. We argue that the conventional TDA-based FDR in proteoform identification is in fact protein-level FDR rather than proteoform-level FDR unless precursor deconvolution error rate is taken into account. To address this issue, we propose a formula to correct for proteoform-level FDR bias by combining TDA-based FDR and precursor deconvolution error rate.


Assuntos
Peptídeos , Proteômica , Proteínas de Ligação a DNA
6.
J Cheminform ; 15(1): 52, 2023 May 12.
Artigo em Inglês | MEDLINE | ID: mdl-37173725

RESUMO

Metabolomics experiments generate highly complex datasets, which are time and work-intensive, sometimes even error-prone if inspected manually. Therefore, new methods for automated, fast, reproducible, and accurate data processing and dereplication are required. Here, we present UmetaFlow, a computational workflow for untargeted metabolomics that combines algorithms for data pre-processing, spectral matching, molecular formula and structural predictions, and an integration to the GNPS workflows Feature-Based Molecular Networking and Ion Identity Molecular Networking for downstream analysis. UmetaFlow is implemented as a Snakemake workflow, making it easy to use, scalable, and reproducible. For more interactive computing, visualization, as well as development, the workflow is also implemented in Jupyter notebooks using the Python programming language and a set of Python bindings to the OpenMS algorithms (pyOpenMS). Finally, UmetaFlow is also offered as a web-based Graphical User Interface for parameter optimization and processing of smaller-sized datasets. UmetaFlow was validated with in-house LC-MS/MS datasets of actinomycetes producing known secondary metabolites, as well as commercial standards, and it detected all expected features and accurately annotated 76% of the molecular formulas and 65% of the structures. As a more generic validation, the publicly available MTBLS733 and MTBLS736 datasets were used for benchmarking, and UmetaFlow detected more than 90% of all ground truth features and performed exceptionally well in quantification and discriminating marker selection. We anticipate that UmetaFlow will provide a useful platform for the interpretation of large metabolomics datasets.

7.
BMC Bioinformatics ; 24(1): 88, 2023 Mar 08.
Artigo em Inglês | MEDLINE | ID: mdl-36890446

RESUMO

BACKGROUND: Personalized oncology represents a shift in cancer treatment from conventional methods to target specific therapies where the decisions are made based on the patient specific tumor profile. Selection of the optimal therapy relies on a complex interdisciplinary analysis and interpretation of these variants by experts in molecular tumor boards. With up to hundreds of somatic variants identified in a tumor, this process requires visual analytics tools to guide and accelerate the annotation process. RESULTS: The Personal Cancer Network Explorer (PeCaX) is a visual analytics tool supporting the efficient annotation, navigation, and interpretation of somatic genomic variants through functional annotation, drug target annotation, and visual interpretation within the context of biological networks. Starting with somatic variants in a VCF file, PeCaX enables users to explore these variants through a web-based graphical user interface. The most protruding feature of PeCaX is the combination of clinical variant annotation and gene-drug networks with an interactive visualization. This reduces the time and effort the user needs to invest to get to a treatment suggestion and helps to generate new hypotheses. PeCaX is being provided as a platform-independent containerized software package for local or institution-wide deployment. PeCaX is available for download at https://github.com/KohlbacherLab/PeCaX-docker .


Assuntos
Neoplasias , Software , Humanos , Genômica/métodos , Neoplasias/genética , Oncologia
8.
Neurooncol Adv ; 5(1): vdad012, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36915613

RESUMO

Background: The clinical utility of molecular profiling and targeted therapies for neuro-oncology patients outside of clinical trials is not established. We aimed at investigating feasibility and clinical utility of molecular profiling and targeted therapy in adult patients with advanced tumors in the nervous system within a prospective observational study. Methods: molecular tumor board (MTB)@ZPM (NCT03503149) is a prospective observational precision medicine study for patients with advanced tumors. After inclusion of patients, we performed comprehensive molecular profiling, formulated ranked biomarker-guided therapy recommendations based on consensus by the MTB, and collected prospective clinical outcome data. Results: Here, we present initial data of 661 adult patients with tumors of the nervous system enrolled by December 31, 2021. Of these, 408 patients were presented at the MTB. Molecular-instructed therapy recommendations could be made in 380/408 (93.1%) cases and were prioritized by evidence levels. Therapies were initiated in 86/380 (22.6%) cases until data cutoff. We observed a progression-free survival ratio >1.3 in 31.3% of patients. Conclusions: Our study supports the clinical utility of biomarker-guided therapies for neuro-oncology patients and indicates clinical benefit in a subset of patients. Our data might inform future clinical trials, translational studies, and even clinical care.

9.
Br J Cancer ; 128(9): 1777-1787, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-36823366

RESUMO

BACKGROUND: The immune peptidome of OPSCC has not previously been studied. Cancer-antigen specific vaccination may improve clinical outcome and efficacy of immune checkpoint inhibitors such as PD1/PD-L1 antibodies. METHODS: Mapping of the OPSCC HLA ligandome was performed by mass spectrometry (MS) based analysis of naturally presented HLA ligands isolated from tumour tissue samples (n = 40) using immunoaffinity purification. The cohort included 22 HPV-positive (primarily HPV-16) and 18 HPV-negative samples. A benign reference dataset comprised of the HLA ligandomes of benign haematological and tissue datasets was used to identify tumour-associated antigens. RESULTS: MS analysis led to the identification of naturally HLA-presented peptides in OPSCC tumour tissue. In total, 22,769 peptides from 9485 source proteins were detected on HLA class I. For HLA class II, 15,203 peptides from 4634 source proteins were discovered. By comparative profiling against the benign HLA ligandomic datasets, 29 OPSCC-associated HLA class I ligands covering 11 different HLA allotypes and nine HLA class II ligands were selected to create a peptide warehouse. CONCLUSION: Tumour-associated peptides are HLA-presented on the cell surfaces of OPSCCs. The established warehouse of OPSCC-associated peptides can be used for downstream immunogenicity testing and peptide-based immunotherapy in (semi)personalised strategies.


Assuntos
Antígenos HLA , Neoplasias Otorrinolaringológicas , Infecções por Papillomavirus , Carcinoma de Células Escamosas de Cabeça e Pescoço , Humanos , Infecções por Papillomavirus/imunologia , Peptídeos/imunologia , Vacinação , Neoplasias Otorrinolaringológicas/imunologia , Antígenos HLA/imunologia , Antígenos de Neoplasias/imunologia , Papillomavirus Humano 16 , Papillomavirus Humano 18
10.
BMC Neurol ; 23(1): 2, 2023 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-36597038

RESUMO

BACKGROUND: Although of high individual and socioeconomic relevance, a reliable prediction model for the prognosis of juvenile stroke (18-55 years) is missing. Therefore, the study presented in this protocol aims to prospectively validate the discriminatory power of a prediction score for the 3 months functional outcome after juvenile stroke or transient ischemic attack (TIA) that has been derived from an independent retrospective study using standard clinical workup data. METHODS: PREDICT-Juvenile-Stroke is a multi-centre (n = 4) prospective observational cohort study collecting standard clinical workup data and data on treatment success at 3 months after acute ischemic stroke or TIA that aims to validate a new prediction score for juvenile stroke. The prediction score has been developed upon single center retrospective analysis of 340 juvenile stroke patients. The score determines the patient's individual probability for treatment success defined by a modified Rankin Scale (mRS) 0-2 or return to pre-stroke baseline mRS 3 months after stroke or TIA. This probability will be compared to the observed clinical outcome at 3 months using the area under the receiver operating characteristic curve. The primary endpoint is to validate the clinical potential of the new prediction score for a favourable outcome 3 months after juvenile stroke or TIA. Secondary outcomes are to determine to what extent predictive factors in juvenile stroke or TIA patients differ from those in older patients and to determine the predictive accuracy of the juvenile stroke prediction score on other clinical and paraclinical endpoints. A minimum of 430 juvenile patients (< 55 years) with acute ischemic stroke or TIA, and the same number of older patients will be enrolled for the prospective validation study. DISCUSSION: The juvenile stroke prediction score has the potential to enable personalisation of counselling, provision of appropriate information regarding the prognosis and identification of patients who benefit from specific treatments. TRIAL REGISTRATION: The study has been registered at https://drks.de on March 31, 2022 ( DRKS00024407 ).


Assuntos
Ataque Isquêmico Transitório , AVC Isquêmico , Acidente Vascular Cerebral , Humanos , Adulto Jovem , Idoso , Ataque Isquêmico Transitório/diagnóstico , Ataque Isquêmico Transitório/epidemiologia , Ataque Isquêmico Transitório/complicações , AVC Isquêmico/complicações , Estudos Retrospectivos , Acidente Vascular Cerebral/diagnóstico , Acidente Vascular Cerebral/epidemiologia , Acidente Vascular Cerebral/complicações , Prognóstico , Valor Preditivo dos Testes , Estudos Observacionais como Assunto
11.
J Clin Med ; 11(19)2022 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-36233463

RESUMO

To identify potential genetic causes for Mayer-Rokitansky-Küster-Hauser syndrome (MRKH), we analyzed blood and rudimentary uterine tissue of 5 MRKH discordant monozygotic twin pairs. Assuming that a variant solely identified in the affected twin or affected tissue could cause the phenotype, we identified a mosaic variant in ACTR3B with high allele frequency in the affected tissue, low allele frequency in the blood of the affected twin, and almost absent in blood of the unaffected twin. Focusing on MRKH candidate genes, we detected a pathogenic variant in GREB1L in one twin pair and their unaffected mother showing a reduced phenotypic penetrance. Furthermore, two variants of unknown clinical significance in PAX8 and WNT9B were identified. In addition, we conducted transcriptome analysis of affected tissue and observed perturbations largely similar to those in sporadic cases. These shared transcriptional changes were enriched for terms associated with estrogen and its receptors pointing at a role of estrogen in MRKH pathology. Our genome sequencing approach of blood and uterine tissue of discordant twins is the most extensive study performed on twins discordant for MRKH so far. As no clear pathogenic differences were detected, research to evaluate other regulatory layers are required to better understand the complex etiology of MRKH.

12.
Nat Commun ; 13(1): 6212, 2022 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-36266287

RESUMO

Lysosomes are well-established as the main cellular organelles for the degradation of macromolecules and emerging as regulatory centers of metabolism. They are of crucial importance for cellular homeostasis, which is exemplified by a plethora of disorders related to alterations in lysosomal function. In this context, protein complexes play a decisive role, regulating not only metabolic lysosomal processes but also lysosome biogenesis, transport, and interaction with other organelles. Using cross-linking mass spectrometry, we analyze lysosomes and early endosomes. Based on the identification of 5376 cross-links, we investigate protein-protein interactions and structures of lysosome- and endosome-related proteins. In particular, we present evidence for a tetrameric assembly of the lysosomal hydrolase PPT1 and a heterodimeric structure of FLOT1/FLOT2 at lysosomes and early endosomes. For FLOT1-/FLOT2-positive early endosomes, we identify >300 putative cargo proteins and confirm eleven substrates for flotillin-dependent endocytosis, including the latrophilin family of adhesion G protein-coupled receptors.


Assuntos
Endossomos , Lisossomos , Endossomos/metabolismo , Lisossomos/metabolismo , Proteínas de Membrana/metabolismo , Hidrolases/metabolismo
13.
Nat Commun ; 13(1): 4407, 2022 07 29.
Artigo em Inglês | MEDLINE | ID: mdl-35906205

RESUMO

The detailed analysis and structural characterization of proteoforms by top-down proteomics (TDP) has gained a lot of interest in biomedical research. Data-dependent acquisition (DDA) of intact proteins is non-trivial due to the diversity and complexity of proteoforms. Dedicated acquisition methods thus have the potential to greatly improve TDP. Here, we present FLASHIda, an intelligent online data acquisition algorithm for TDP that ensures the real-time selection of high-quality precursors of diverse proteoforms. FLASHIda combines fast charge deconvolution algorithms and machine learning-based quality assessment for optimal precursor selection. In an analysis of E. coli lysate, FLASHIda increases the number of unique proteoform level identifications from 800 to 1500 or generates a near-identical number of identifications in one third of the instrument time when compared to standard DDA mode. Furthermore, FLASHIda enables sensitive mapping of post-translational modifications and detection of chemical adducts. As a software extension module to the instrument, FLASHIda can be readily adopted for TDP studies of complex samples to enhance proteoform identification rates.


Assuntos
Proteoma , Proteômica , ATPases Transportadoras de Cálcio do Retículo Sarcoplasmático/antagonistas & inibidores , Proteínas de Ligação a DNA/metabolismo , Escherichia coli/metabolismo , Coração , Peptídeos , Processamento de Proteína Pós-Traducional , Proteoma/metabolismo , Proteômica/métodos
14.
BMC Neurol ; 22(1): 238, 2022 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-35773640

RESUMO

BACKGROUND: Stroke is one of the most frequent diseases, and half of the stroke survivors are left with permanent impairment. Prediction of individual outcome is still difficult. Many but not all patients with stroke improve by approximately 1.7 times the initial impairment, that has been termed proportional recovery rule. The present study aims at identifying factors predicting motor outcome after stroke more accurately than before, and observe associations of rehabilitation treatment with outcome. METHODS: The study is designed as a multi-centre prospective clinical observational trial. An extensive primary data set of clinical, neuroimaging, electrophysiological, and laboratory data will be collected within 96 h of stroke onset from patients with relevant upper extremity deficit, as indexed by a Fugl-Meyer-Upper Extremity (FM-UE) score ≤ 50. At least 200 patients will be recruited. Clinical scores will include the FM-UE score (range 0-66, unimpaired function is indicated by a score of 66), Action Research Arm Test, modified Rankin Scale, Barthel Index and Stroke-Specific Quality of Life Scale. Follow-up clinical scores and applied types and amount of rehabilitation treatment will be documented in the rehabilitation hospitals. Final follow-up clinical scoring will be performed 90 days after the stroke event. The primary endpoint is the change in FM-UE defined as 90 days FM-UE minus initial FM-UE, divided by initial FM-UE impairment. Changes in the other clinical scores serve as secondary endpoints. Machine learning methods will be employed to analyze the data and predict primary and secondary endpoints based on the primary data set and the different rehabilitation treatments. DISCUSSION: If successful, outcome and relation to rehabilitation treatment in patients with acute motor stroke will be predictable more reliably than currently possible, leading to personalized neurorehabilitation. An important regulatory aspect of this trial is the first-time implementation of systematic patient data transfer between emergency and rehabilitation hospitals, which are divided institutions in Germany. TRIAL REGISTRATION: This study was registered at ClinicalTrials.gov ( NCT04688970 ) on 30 December 2020.


Assuntos
Reabilitação do Acidente Vascular Cerebral , Acidente Vascular Cerebral , Humanos , Medicina de Precisão , Estudos Prospectivos , Qualidade de Vida , Recuperação de Função Fisiológica/fisiologia , Acidente Vascular Cerebral/complicações , Reabilitação do Acidente Vascular Cerebral/métodos , Extremidade Superior
15.
Methods Mol Biol ; 2500: 145-157, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35657592

RESUMO

Mass deconvolution, the determination of proteoform precursor and fragment masses, is crucial for top-down proteomics data analysis. Here we describe the detailed procedure to run FLASHDeconv, an ultrafast, high-quality mass deconvolution tool. Both spectrum- and feature-level deconvolution results are obtainable in various output formats by FLASHDeconv. FLASHDeconv is runnable in different environments such as the command line and OpenMS workflows.


Assuntos
Análise de Dados , Proteômica , Espectrometria de Massas , Proteômica/métodos
16.
Stud Health Technol Inform ; 294: 674-678, 2022 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-35612174

RESUMO

COVID-19 has challenged the healthcare systems worldwide. To quickly identify successful diagnostic and therapeutic approaches large data sharing approaches are inevitable. Though organizational clinical data are abundant, many of them are available only in isolated silos and largely inaccessible to external researchers. To overcome and tackle this challenge the university medicine network (comprising all 36 German university hospitals) has been founded in April 2020 to coordinate COVID-19 action plans, diagnostic and therapeutic strategies and collaborative research activities. 13 projects were initiated from which the CODEX project, aiming at the development of a Germany-wide Covid-19 Data Exchange Platform, is presented in this publication. We illustrate the conceptual design, the stepwise development and deployment, first results and the current status.


Assuntos
COVID-19 , Atenção à Saúde , Alemanha , Hospitais Universitários , Humanos , Disseminação de Informação
17.
Mol Biol Evol ; 39(6)2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35578825

RESUMO

Human expansion in the course of the Neolithic transition in western Eurasia has been one of the major topics in ancient DNA research in the last 10 years. Multiple studies have shown that the spread of agriculture and animal husbandry from the Near East across Europe was accompanied by large-scale human expansions. Moreover, changes in subsistence and migration associated with the Neolithic transition have been hypothesized to involve genetic adaptation. Here, we present high quality genome-wide data from the Linear Pottery Culture site Derenburg-Meerenstieg II (DER) (N = 32 individuals) in Central Germany. Population genetic analyses show that the DER individuals carried predominantly Anatolian Neolithic-like ancestry and a very limited degree of local hunter-gatherer admixture, similar to other early European farmers. Increasing the Linear Pottery culture cohort size to ∼100 individuals allowed us to perform various frequency- and haplotype-based analyses to investigate signatures of selection associated with changes following the adoption of the Neolithic lifestyle. In addition, we developed a new method called Admixture-informed Maximum-likelihood Estimation for Selection Scans that allowed us test for selection signatures in an admixture-aware fashion. Focusing on the intersection of results from these selection scans, we identified various loci associated with immune function (JAK1, HLA-DQB1) and metabolism (LMF1, LEPR, SORBS1), as well as skin color (SLC24A5, CD82) and folate synthesis (MTHFR, NBPF3). Our findings shed light on the evolutionary pressures, such as infectious disease and changing diet, that were faced by the early farmers of Western Eurasia.


Assuntos
Fazendeiros , Migração Humana , Agricultura , DNA Antigo , DNA Mitocondrial/genética , Europa (Continente) , Genética Populacional , História Antiga , Humanos
18.
BMC Bioinformatics ; 23(1): 139, 2022 Apr 19.
Artigo em Inglês | MEDLINE | ID: mdl-35439941

RESUMO

BACKGROUND: With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathway is functionally involved in the biological process under study. De novo identification of these pathways can reduce the bias inherent in predefined pathways or gene sets. At the same time, the definition and efficient identification of these pathways de novo from large biological networks is a challenging problem. RESULTS: We present a novel algorithm, DeRegNet, for the identification of maximally deregulated subnetworks on directed graphs based on deregulation scores derived from (multi-)omics data. DeRegNet can be interpreted as maximum likelihood estimation given a certain probabilistic model for de-novo subgraph identification. We use fractional integer programming to solve the resulting combinatorial optimization problem. We can show that the approach outperforms related algorithms on simulated data with known ground truths. On a publicly available liver cancer dataset we can show that DeRegNet can identify biologically meaningful subgraphs suitable for patient stratification. DeRegNet can also be used to find explicitly multi-omics subgraphs which we demonstrate by presenting subgraphs with consistent methylation-transcription patterns. DeRegNet is freely available as open-source software. CONCLUSION: The proposed algorithmic framework and its available implementation can serve as a valuable heuristic hypothesis generation tool contextualizing omics data within biomolecular networks.


Assuntos
Algoritmos , Software , Viés , Humanos , Modelos Estatísticos
19.
Nat Commun ; 13(1): 1347, 2022 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-35292629

RESUMO

The extraction of meaningful biological knowledge from high-throughput mass spectrometry data relies on limiting false discoveries to a manageable amount. For targeted approaches in metabolomics a main challenge is the detection of false positive metabolic features in the low signal-to-noise ranges of data-independent acquisition results and their filtering. Another factor is that the creation of assay libraries for data-independent acquisition analysis and the processing of extracted ion chromatograms have not been automated in metabolomics. Here we present a fully automated open-source workflow for high-throughput metabolomics that combines data-dependent and data-independent acquisition for library generation, analysis, and statistical validation, with rigorous control of the false-discovery rate while matching manual analysis regarding quantification accuracy. Using an experimentally specific data-dependent acquisition library based on reference substances allows for accurate identification of compounds and markers from data-independent acquisition data in low concentrations, facilitating biomarker quantification.


Assuntos
Metabolômica , Biomarcadores , Espectrometria de Massas , Metabolômica/métodos , Fluxo de Trabalho
20.
J Proteome Res ; 21(4): 1204-1207, 2022 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-35119864

RESUMO

Machine learning is increasingly applied in proteomics and metabolomics to predict molecular structure, function, and physicochemical properties, including behavior in chromatography, ion mobility, and tandem mass spectrometry. These must be described in sufficient detail to apply or evaluate the performance of trained models. Here we look at and interpret the recently published and general DOME (Data, Optimization, Model, Evaluation) recommendations for conducting and reporting on machine learning in the specific context of proteomics and metabolomics.


Assuntos
Metabolômica , Proteômica , Aprendizado de Máquina , Metabolômica/métodos , Proteômica/métodos , Espectrometria de Massas em Tandem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...